NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Data Attribution for Text-to-Image Models by Unlearning Synthesized Images

Wang, Sheng-Yu; Hertzmann, Aaron; Efros, Alexei; Zhu, Jun-Yan; Zhang, Richard (December 2024, 2024 Conference on Neural Information Processing Systems)

Full Text Available
Rethinking Score Distillation as a Bridge Between Image Distributions

https://doi.org/10.52202/079017-1064

McAllister, David; Ge, Songwei; Huang, Jia-Bin; Jacobs, David; Efros, Alexei; Holynski, Aleksander; Kanazawa, Angjoo (December 2024, Neural Information Processing Systems Foundation, Inc. (NeurIPS))

Score distillation sampling (SDS) has proven to be an important tool, enabling the use of large-scale diffusion priors for tasks operating in data-poor domains. Unfortunately, SDS has a number of characteristic artifacts that limit its usefulness in general-purpose applications. In this paper, we make progress toward understanding the behavior of SDS and its variants by viewing them as solving an optimal-cost transport path from a source distribution to a target distribution. Under this new interpretation, these methods seek to transport corrupted images (source) to the natural image distribution (target). We argue that current methods’ characteristic artifacts are caused by (1) linear approximation of the optimal path and (2) poor estimates of the source distribution. We show that calibrating the text conditioning of the source distribution can produce high-quality generation and translation results with little extra overhead. Our method can be easily applied across many domains, matching or beating the performance of specialized methods. We demonstrate its utility in text-to-2D, text-based NeRF optimization, translating paintings to real images, optical illusion generation, and 3D sketch-to-real. We compare our method to existing approaches for score distillation sampling and show that it can produce high-frequency details with realistic colors.
more » « less
Full Text Available
Interpreting CLIP's Image Representation via Text-Based Decomposition

Gandelsman, Yossi; Efros, Alexei A; Steinhardt, Jacob (January 2024, ICLR 2024)

Full Text Available
Space-Time Correspondence as a Contrastive Random Walk

Jabri, Allan; Owens, Andrew; Efros, Alexei (December 2020, Advances in neural information processing systems)
null (Ed.)
Full Text Available
Few-shot Image Generation via Cross-domain Correspondence

https://doi.org/10.1109/cvpr46437.2021.01060

Ojha, Utkarsh; Li, Yijun; Lu, Jingwan; Efros, Alexei A.; Lee, Yong Jae; Shechtman, Eli; Zhang, Richard (June 2021, Conference on Computer Vision and Pattern Recognition (CVPR))

Training generative models, such as GANs, on a target domain containing limited examples (e.g., 10) can easily result in overfitting. In this work, we seek to utilize a large source domain for pretraining and transfer the diversity information from source to target. We propose to preserve the relative similarities and differences between instances in the source via a novel cross-domain distance consistency loss. To further reduce overfitting, we present an anchor-based strategy to encourage different levels of realism over different regions in the latent space. With extensive results in both photorealistic and non-photorealistic domains, we demonstrate qualitatively and quantitatively that our few-shot model automatically discovers correspondences between source and target domains and generates more diverse and realistic images than previous methods.
more » « less
Full Text Available
RANSAC-Flow: Generic Two-Stage Image Alignment

https://doi.org/10.1007/978-3-030-58548-8_36

Shen, Xi; Darmon, François; Efros, Alexei; Aubry, Mathieu (October 2020, Computer Vision – ECCV 2020. ECCV 2020. Lecture Notes in Computer Science)
Vedaldi A., Bischof H. (Ed.)
Full Text Available
Contrastive Learning for Unpaired Image-to-Image Translation

https://doi.org/10.1007/978-3-030-58545-7_19

Park, Taesung; Efros, Alexei; Zhang, Richard; Zhu, Jun-Yan (October 2020, Computer Vision – ECCV 2020. ECCV 2020. Lecture Notes in Computer Science,)
Vedaldi A., Bischof H. (Ed.)
Full Text Available
Learning to Factorize and Relight a City

https://doi.org/10.1007/978-3-030-58548-8_32

Liu, Andrew; Ginosar, Shiry; Zhou, Tinghui; Efros, Alexei; Snavely, Noah (October 2020, Computer Vision – ECCV 2020. ECCV 2020. Lecture Notes in Computer Science)

Full Text Available
Test-Time Training with Self-Supervision for Generalization under Distribution Shifts

Sun, Yu; Wang, Xiaolong; Liu, Zhuang; Miller, John; Efros, Alexei A.; Hardt, Moritz (April 2020, ICML 2020)
null (Ed.)
In this paper, we propose Test-Time Training, a general approach for improving the performance of predictive models when training and test data come from different distributions. We turn a single unlabeled test sample into a self-supervised learning problem, on which we update the model parameters before making a prediction. This also extends naturally to data in an online stream. Our simple approach leads to improvements on diverse image classification benchmarks aimed at evaluating robustness to distribution shifts.
more » « less
Full Text Available
Everybody Dance Now

https://doi.org/10.1109/ICCV.2019.00603

Chan, Caroline; Ginosar, Shiry; Zhou, Tinghui; Efros, Alexei (October 2019, International Conference on Computer Vision (ICCV))
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records